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In humans over 15% of X-linked genes have been shown to 
escape from X-chromosome inactivation (XCI): they con- 
tinue to be expressed to some extent from the inactive X 
chromosome. Mono-allelic expression is anticipated within a 
cell for genes subject to XCI, but random XCI usually results 
in expression of both alleles in a cell population. Using a 
study of allelic expression from cultured lymphoblasts and 
fibroblasts, many of which showed substantial skewing of 
XCI, we recently reported that the expression of genes lies 
on a contiunuum between those that are subject to 
inactivation, and those that escape. We now review allelic 
expression studies from mouse, and discuss the variability in 
escape seen in both humans and mice in genie expression 
levels, between X chromosomes and between tissues. We 
also discuss current knowledge of the heterochromatic 
features, DNA elements and three-dimensional topology of 
the inactive X that contribute to the balance of expression 
from the otherwise inactive X chromosome. 
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Introduction 

The inactivation of almost a thousand genes on one of the 
two essentially identical X chromosomes in female nuclei is 
perhaps the most striking example known of epigenetic gene 
silencing; however, not all of the genes on the inactive X 
chromosome (Xi) are silenced. In humans, approximately 15% 
of X-linked genes escape from X-chromosome inactivation 
(XCI) and are likely principal contributors to the phenotypes of 
humans with X-chromosome aneuploidies. In mice, a smaller 
percentage (perhaps 3%) of genes escape from XCI, consistent 
with the viability and limited phenotypic consequences for 
female sex chromosome monosomy (reviewed in [1]). It has, 
however, been argued that only a subset of X-linked genes, in 
particular those that are involved in large protein complexes, 
need to be dosage compensated [2]; and similarly, a lack of 
dosage compensation for those genes that escape from XCI 
may only have phenotypic consequences for a subset of genes. 
In general, there is strong synteny of the content of the X 
chromosome amongst eutheria, although the pseudoautoso- 
mal regions (PARs), which continue to recombine between 
the X and Y chromosomes, differ between species [3]. To date, 
it is not known why human and mouse differ so substantially 
in the number of genes that escape from XCI. 

Escape from XCI is not an absolute with either full or no 
expression from the Xi; rather, in both humans and mice, 
variability has been shown in the extent of expression, as well 
as differences between X chromosomes and between tissues as 
to which genes are expressed. Therefore, escape from XCI will 
not only result in differences in expression between males and 
females, but also between tissues and between females, which 
could have important implications for disease predispositions 
between men and women, or amongst women. Furthermore, 
XCI has long served as a paradigm for heterochromatin 
formation, and understanding how genes escape from XCI can 
inform our understanding of how silencing spreads, and how 
certain regions of the X chromosome evade inactivation. 

The Xi is facultative heterochromatin; as in the case of 
classical position effect variegation, the dogma has been that 
inactivation spreads along the chromosome. The ability of 
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silencing to spread incompletely into autosomal material 
translocated to the X chromosome led to the description 
of an X inactivation centre (XIC) from which inactivation 
initiated, and waystations - booster elements that support 
and extend the capacity for spread along the X chromo- 
some [4]. A candidate for the initiating factor in the XIC is the 
long non-coding RNA XIST, and long interspersed nuclear 
elements (LlNEs) have been proposed as potential way- 
stations. Characterisation of cis-regulatory elements in 
humans has been hindered by the lack of a tractable 
developmental model system, as there is considerable 
epigenetic instability in human female embryonic stem (ES) 
cells that have often already undergone XCl (e.g. [5]). On the 
other hand, in female mouse ES cells, one X chromosome 
undergoes XCl during differentiation [6], thus providing a 
system to study XCl in culture as well as in mouse models. 

The fundamental mechanisms of XCl and the marks of 
an Xi appear to be very similar between mice and humans. 
However, in addition to the differing number of genes that 
escape from XCl, there are a few key divergences in the XIC 
regions, including the regulatory function of the Xist 
antagoniser Tsix [7, 8], and the timing of XCl and presence 
of imprinted XCl in extraembryonic tissues (reviewed in [9]). 
Furthermore, another potential source of variability is the long 
non-coding RNA termed XACT, which transiently coats the 
active X chromosome (Xa) but has only been identified in 
humans and chimpanzees [10]. Despite these differences, the 
success of using human XIST transgenes to recapitulate XCl in 
the mouse [11, 12], and the recent demonstration that escape 
from XCl of human genes is possible from the mouse X 
chromosome [13], have illustrated the importance of manipu- 
lation of mouse models for understanding human XCl. The 
ability to follow the initial steps of XCl in mouse has been 
crucial in identifying the cascade of events during XCl, 
including recent examinations of the chromosome topology. 
These have revealed new insights into Xi chromatin and 
epigenetic interactions, an important step towards under- 
standing the process of silencing - and escape. 



Human genes that escape from XCl 
correlate with evolutionary history of the 
sex chromosomes 

XCl achieves dosage equivalence between the two X 
chromosomes in females and the single X in males, and thus 
the need for XCl is believed to have been driven by the loss of 
genes from the Y chromosome, which also provided pressure 
to up-regulate expression levels from the Xa [14]. Genes in the 
PAR are prime examples of genes that escape from XCl [15]; 
however, evidence that a significant portion of human 
X-linked genes might escape from XCl accumulated as more 
genes outside the PARI were identified that were expressed 
from the Xi (e.g. STS [16]). In 2005, Carrel and Willard 
published the first large 'survey' of the XCl status of 634 
X-linked human transcripts using two different methods [17]. 
The first method relied upon rodent/human somatic cell 
hybrids that retained either a human Xa or Xi. The second 
method examined the expression of polymorphisms from each 



X chromosome in human cell lines that had non-random or 
clonal XCl. In a female with clonal XCl, a gene that is subject 
to XCl will have mono-allelic expression whereas a gene that 
escapes from XCl will show bi-allelic expression proportional 
to the degree of escape. Cell lines with clonal XCl were 
obtained by examining fibroblasts from females with 
structurally abnormal X chromosomes, although other studies 
have used carriers of X-linked disease, selection, or clonal 
expansion from a single cell to achieve the same purpose. 
Escape from XCl and the level of Xi expression was shown to 
be strongly correlated with X and Y chromosome homology 
that in turn is related to the length of time since the genes on 
the X and Y chromosomes diverged [3, 17, 18]. 

Mouse RNA-seq studies demonstrate 
variability in escape from XCl 

The growing plethora of genome-wide studies has provided 
further opportunities to identify genes that escape from XCl. 
Mouse RNA-seq studies in cells with non-random XCl have 
validated that mice have fewer genes that escape from XCl than 
humans and have laid the foundation for expanding RNA-seq 
analysis into humans [19-22]. As shown in Table 1, 31 X-linked 
mouse genes have shown some evidence for escape from XCl 
(defined as either greater than 10% expression from the Xi or 
bi-allelic expression) in brain, neural stem cell and/or kidney 
cell lines. Thirteen genes (highlighted in bold in Table 1) 
showed escape from XCl across the majority of studies. 
Variation between studies may be caused by several factors 
including: differences in genie Xi expression levels; differences 
between X chromosomes, differences between tissues and/or 
differences between females (illustrated in Fig. 1). 

A continuum of expression from the Xi, from zero to a 
maximum of approximately 70% of the level seen from the 
Xa was observed in humans [23], and similarly in Table 1, 
expression levels varied from the Xi in mice. It is not known 
whether all cells had Xi expression at the same level, or 
whether only a subset of cells showed full Xi expression while 
other cells showed none. Previous studies using RNA FISH 
suggested that even a consistent gene that escapes from XCl, 
KdmSc, only showed expression from both X chromosomes in 
a small subset of cells [24]; while single-cell RT-PCR analysis 
of other genes in humans has demonstrated that expression 
from the Xi was present in all cells but highly varied [25]. 

These mouse RNA-seq studies used interspecies crosses 
to maximise the number of polymorphisms that could be 
examined, so it was possible to identify strain-specific escape 
from XCl, which is an example of the inter-chromosomal 
variability observed in escape from XCl between females 
(Fig. IC). Seven mouse genes showed evidence for strain-specific 
escape, suggesting that escape from XCl may be influenced 
by features of the X chromosome present in one strain but 
not the other. In humans. Carrel and Willard [17] observed 
variability between females, and our survey suggested that 
approximately 13%o of genes would variably escape in some but 
not all females and that an additional 10% of genes may show 
variable escape due to population differences in XCl [23]. 

Unlike the other mouse RNA-seq studies that examined 
somatic tissues, Calabrese et al. [22] examined trophoblast 
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Table 1. Summary of mouse genes that escape from XCI based on RNA-seq data 



Mouse 
gene name 


Brain 
cells [19] 


Neural 

stem cells [21] 


Embryonic 
kidney cells [20] 


Trophoblast 
stem cells [22] 


Human 
gene name 


Hybrid 
escape [17] 


ShroomA 


- 


0% 


ggo/^musc. 


1% 


SHROOMA 


0/9 


CIcnS 


7% 


0% 


- 


0% 


CLCN5 


0/9 


Syp 


5% 


0% 


- 


- 


SYP 


2/9 


TimmUb 


28% 


0% 


- 


1% 


TIMM17B 


0/9 ^ 


Gm4984 




- 


- 


- 


- 




1810030O07Rik 


3% 


0% 


13%'""^"- 


0% 


CXorf38 


8/9 


Ddx3x 


20% 


0% 


71 %'""^'=- 


- 


DDX3X 


9/9 ^ 




46% 


75% 




9% 


KDM6A 


9/9 H 


Utp14a 


10% 


50% 


- 


16% 


UTP1AA 


3/9 ^ 


6720401 G13Rik 


- 


0% 


130%'""^''- 


- 


- 




Cdr1 


9% 


0% 


- 


- 


- 




Hmgb3 


4% 


0% 


- 


0% 


HMGB3 


0/9 


Bgn 


6% 


0% 


25%'""^''- 


- 


BGN 


0/6 ^ 


Eif2s3x 


51% 


100% 


ygo/^musc. 


28% 


EIF2S3 




Med12 


4% 


0% 


1% 


- 


MED12 


0/9 ^ 


Taf1 


6% 


0% 


0% 


37% 


TAF1 


0/5 


Chid 


5% 


0% 


- 


- 


CHIC1 


0/9 J 


Enox^ 


27% 


- 


— 


46% 


JPX 




5530601H04Rik 


31% 


100% 


— 




- 




2610029G23Rik 


- 


50% 




11%'""^=- 


CXorf26 


0/9 ■ 


Itm2a 




0% 


— 


5% 


ITM2A 


5/9 J 


Wbp5 




0% 


0% 




WBP5 




Gnl3l 


6% 






0% 


GNL3L 


0/9 ^ 


Hsd17b10 


3% 


0% 


3% 


0% 


HSF17B10 


t/9 H 


KdmSc'' 


29% 


75% 


43%'""^"- 


16% 


KDM5C 


9/9 ^ 


Sat1 


8% 


0% 




0% 


SAT1 




CarSb 




0% 






CA5B 


9/9 ^ 


Trappc2 


3% 


0% 




6% 


TRAPPC2 


9/9 H 


BC022960 






gQO/^musc. 








Midi 


52% 




184%'""^''- 




MIDI 


1/9 ■ 


G530011O06Rik 




33% 










Trophoblast 








8 escape 






specific escape 








7 escape'""^"- 
1 escape"^'" 






Total genes 


263 


268 


135 


369 






examined 















All studies examined both reciprocal crosses except for Yang et al. [20] in which only the B6 X chromosome was the Xi. The average percent 
Xi expression relative to the Xa is given for each examined gene in Wu et al. [19], Yang et al. [20] and Calabrese et al. [22]. For Li et al. [21], 
the percent of bi-allelic cell lines is given. Genes highlighted in bold suggest escape from XCI in the majority of studies. Superscripts of 'muse' 
and 'casf.' indicate that escape from XCI was only observed in one mouse strain. Corresponding human homologues and the XCI status 
established in Carrel and Willard [17] are shown to the far right. 
^KdmSa is also known as Utx. 
'°Enox is also known as Jpx and 2010000l03Rik. 
''KdmSc is also known as Jaridic and Smcx. 



stem cells, which showed evidence for escape from XCI for an 
additional 22 genes. The higher degree of escape from XCI in 
trophoblast stem cells could be considered an extreme 
example of differences in XCI based on tissue (Fig. ID), but 
also raises the question of whether escape from XCI is truly 
escape or if escape is rather reactivation; trophoblast has been 
reported to be more epigenetically plastic [26]. In the case of 
KdmSc, there is initial silencing, but then rapid reactivation on 
the Xi [27], whereas Otc has been shown to reactivate slowly 
with age [28]. In humans, approximately 9% of genes show 
tissue-specific XCI [23], but little is known about early, or age- 
related reactivation. 



Caveats in defining tlie variability in 
escape from XCI 

Escape from XCI is also known to differ between females as a 
result of the above differences between X chromosomes and 
tissues, but with the added complexity of skewing of XCI 
(Fig. IE). While mouse crosses can be undertaken with 
particular alleles to bias XCI towards the inactivation of a 
specific X chromosome (e.g. [29]), most human females have 
random XCI in all tissues including blood [30]. While new high 
throughput methodologies allow for the XCI status of more 
genes to be examined in an increasing number of tissues. 
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Figure 1. Variability in dosage compensation on the X chromo- 
some. A: The X and Y chromosomes evolved from an ancestral 
pair of autosomes. After their divergence, expression of X-linked 
genes from the Xa was increased, and the majority of genes 
were silenced on the Xi. Note that only a subset of genes is 
illustrated in all figures. B: Not all genes that escape from XCI 
show the same level of expression. Longer arrows denote higher 
Xi expression but expression is still not equal to the Xa. C: A 
gene may escape from XCI on certain X chromosomes but not 
others. To simplify, variable Xi expression levels are not shown in 
parts (C) and (D). D: Escape from XCI may occur only in certain 
tissues for some genes. E: The level of skewing of XCI, as 
illustrated by grey and white circles, differs between females and 
tissues. As escape from inactivation can differ between X 
chromosomes (shown in C), skewing will alter the overall 
expression level. 



studies examining non-clonal samples must take into account 
the level of skewing of XCI within each sample, which has not 
been done in all cases [31]. Studies using DNA methylation 
(DNAm) [32], which is acquired by the Xi but low to absent on 
the Xa, do not require clonal populations; however, studies 
of activity [23], or absence of active marks such as RNA 
polymerase II [33] must either use proven clonal cell lines or 
adjust for the extent of skewing of XCI. 

We have discussed escape from XCI in humans and mice; 
however, escape from XCI is observed in other species. There 
are genes for which escape from XCI is conserved across 
species; but in general, it appears that rodents have fewer 
genes that escape from XCI than other species examined [24] 
although there are also genes that escape XCI only in primates 
(.RPS4X, [34]). Characterizing the variability of escape from 
inactivation, whether between species, individuals, tissues 
or genes, will inform our understanding of how inactivation 
spreads to most, but not all, of the genes on the 160 Mb 
X chromosome. 



"Escape" genes differ from "subject" 
genes in their associated chromatin 
features 

In addition to observing a continuum of expression from 
the Xi, examination of allelic imbalance of histone modifica- 
tions demonstrated a gradation of histone modifications 
on genes subject to or escaping from XCI [23]. More and more 
studies have revealed that chromatin modifications not only 
differ between the Xa and Xi, but that genes that escape from 
XCI show different patterns than genes subject to XCI on the Xi 
(see Table 2). The ability of inducible transgenes of XIST/Xist 
to trigger inactivation in humans [35] and mice [36], suggests 
that the XIST RNA is the initiating signal for the assembly 
of the facultative heterochromatin on the Xi, and a lack of 
Xist RNA has been noted on some genes that escape from 
XCI [37-39]. Whether this deficiency reflects an initial lack of 
spreading to these areas or whether it is actively cleared or 
passively lost at a later point in time remains unanswered, 
as there are suggestions that the initial spread of Xist and 
silencing may be more extensive [27, 40]. 

Other modifications found to be associated with the Xi 
include the loss of active chromatin marks such as acetylation 
of histones and H3K4me3, as well as the gain of repressive 
histone marks including the polycomb repressive complex 
(PRC)-l-mediated H2A ubiquitination, H3K9me2/3 and the 
PRC2-mediated H3K27me3 (reviewed in [41]). Genomic regions 
of H3K27me3 and XIST were reported to recruit SMCHDl, 
which in turn interacts with the HPl-binding protein HBiXl 
from the distinct H3K9me3 domain, bridging the two domains 
and working together to create the compact Xi structure [42], 
which can then recruit further marks such as DNAm. In 
addition, enrichment of the variant histones MacroH2Al 
and MacroH2A2 was noted on the Xi [43], and other histone 
variants showed varying levels of enrichment or deple- 
tion [44]. Analysed genes that escape from XCI appear to have 
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Table 2. Summary of chromatin marks distinguishing genes subject to and escaping from XCI 



(0 
(0 

w 

■> 

0 



Feature 


Subject (mouse 
and human) 


Escape (mouse) 


Escape (human) 


XIST/Xist coating 


Present [96-99] 


Depleted KdmSc, 
Kdm6a [37] ''[38, 39]" 




H3K4 methylation 


Depleted [100-103] 


Present [45] " 


Present UBA1 [46]'' [23, 45]" 


H3 and H4 acetylation 


Depleted [100, 104, 105] 


Present KdmSc, 
Eif2s3x [76]^[104]'' 


Present EIF2SI3 [76]''X/Sr, 
ZFX, KDM5C [^0^^ M 
UBA1 [46] "[23]" 


H3K27 methylation 


Present [103, 106] 


Depleted [20] " 


Depleted UBA 1 [46] " [23] " _ 


H2A ubiquitination 


Present [1 07-1 09] 






H3K9 methylation 


Present [100, 101, 110-112] 




Depleted UB/A 7 [46]" H 


H4K20 methylation 


Present [113] 




Depleted l/B/\ 7 [46]" J 


MacroH2A 


Present [43, 114-116] 


Depleted Xist, Eif2s3x, 
Kdm5c, Midi [47]'' 




Replication timing 


Late [48] 




Early [49, 50] " 


DNAm of CpG islands 


Present [117-120] 


Depleted KdmSc [76] ^ 


Depleted /W/C2 [121]"; fl 


in promoter region 






T/MP 7 [122]" UB/A 7, ■ 
PCTM [123]"[32, 60]" 1 



Features seen in genes subject to XCI are consistent between mouse and human, and are grouped together; while features associated with 
genes that escape from XCI are separate for mouse and human as several of the marks remain untested in one or the other species. 
References for escape features are specified based on the number of escape genes/domains studied (see footnote). Genes studied for 
features conferring XCI are not indicated. Alternate gene names are listed in Table 1, in addition UBA1 is also known as UBE1. 
"Studies examined specific genes (<5) that escape from XCI, gene names are listed. 
''Studies examined a group or domain of genes (>5) that escape from XCI. 



euchromatic histone features, while demonstrating depletion 
in the repressive marks [20, 23, 45, 46] and loss of MacroH2A 
recruitment compared to subject domains [47]. 

Late replication was an early feature associated with the 
Xi [48], and recent genome-wide assessment of replication 
timing has demonstrated that replication of the Xi occurs in an 
unstructured fashion in which origins fire randomly and 
replication finishes quickly despite its lagging start [49]. There 
was a significant exception to the randomness seen in an ~8 Mb 
region on the distal short arm of the X chromosome, which 
replicated relatively early and with similar timing and structure 
for the homologous chromosome pairs in both females (Xa and 
Xi) and males (Xa and Y) [49]. Interestingly, this region contains 
the PARI, which is actively expressed on both the Xi and Y, as 
well as a cluster of genes that escape from XCI in females [17]. 
In agreement with these findings, early studies of replication 
timing and transcriptional activity on the X chromosome 
showed evidence of earlier replication of putative escape 
regions [50], or of reactivated genes on the Xi (e.g. [51]), 
although the Xa and Xi have been shown to use the same 
origins of replication [52]. Advances to these high-resolution 
chromosome-wide methods will allow the closer examination of 
the replication timing of individual genes that escape from XCI. 

DNA methylation is an indicator of XCI 
status in eutherians 

DNAm was another epigenetic mark proposed quite early to 
be associated with XCI [53], and unlike the majority of CpG 
islands in the genome, CpG islands at the promoters of genes 
subject to XCI are heavily methylated on the Xi, with the 
exception of genes that escape from XCI, which tend to be 



hypomethylated (e.g. [54]). Interestingly, the CpG island 
promoters of X-linked genes are biased against the strongest 
class of CpG island promoters, which were generally 
associated with broadly expressed housekeeping genes and 
strong GC skew, suggesting that a high GC content was 
protective against DNAm. However, no correlation between 
CpG island promoter class and the ability of a gene to escape 
from XCI was observed [55]. 

In contrast to mice and humans, the Xi of marsupials lacks 
expression of an Xist homologue and DNAm, with only one 
X-linked gene, Rsx, identified as differentially methylated 
between the Xa and Xi [56]. Rsx was recently described as 
a large non-coding, repeat-rich gene sharing the Xist-like 
property of coating the Xi, and is capable of silencing in cis 
when integrated into mouse ES cells [57]. RNA-seq revealed 
that ^W/o of genes escape from XCI in marsupials, but the 
corresponding hypomethylation of the Xi means that DNAm 
is not a distinguishing factor between genes subject to and 
escaping from XCI in this species. However, a significant 
decrease in repressive H3K27me3 at genes that escape from 
XCI suggested that histone modifications still correlate with 
marsupial XCI [56], and may highlight an important difference 
in escape mechanisms between species. 

DNAm has generally been considered to be a late event 
in XCI, and is often cited as a maintenance mechanism to 
lock in silencing; closer analyses of several X-linked CpG 
islands across the chromosome revealed that DNAm is 
actually established in two separate waves [58]. In addition 
to the Smchdl-dependent pathway that is acquired gradu- 
ally over an extended period [59], Smchdl-independent 
DNAm occurs more rapidly after onset of XCI [58]. Given that 
XCI marks such as DNAm are currently being used as tests 
for predicting the Xi status of an X-linked gene (e.g. [60]), 
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timing of their recruitment to the Xi as well as the 
mechanism of their establishment are important processes 
to be kept in mind. 

Many of the features tested so far have correlated well with 
expression data, as in mouse, particularly mouse ES cells, 
it has been possible to determine when different features 
become visible on the Xi, and whether they are lacldng in 
genes that escape from XCI. There is redundancy as well as co- 
operativity between many of the marks of XCI, since removal 
of a single silencing feature does not lead to complete 
reactivation of the entire X chromosome as demonstrated 
in several studies [61-66]. Understanding the interplay of 
heterochromatin formation on the Xi and the synergy of marks 
would provide insight into the silencing pathways, and 
perhaps offer clues to the right combination of marks needed 
to tip the scale to favour escape from XCI. 

DNA sequences associated with escape 
from XCI 

The consistent ability of some genes to avoid the silencing and 
associated heterochromatic features of XCI suggests that 
the genes' neighbourhood is an important contributor to 
expression. The analysis of spread of silencing into autosomal 
material had suggested that there might be waystations, 
which Lyon proposed to be LINEs [67]. Recently, using high- 
density arrays to profile DNAm as a mark of silencing, spread 
into autosomal material was examined at a high resolution. 
Cotton et al. found an enrichment of Lis within 100 kb of 
genes subject to XCI; however the enrichment of Alu elements 
in gene regions escaping from XCI was more significant [68]. 
A similar study of six different translocations also observed 
an enrichment of Alu, as well as simple repeats, around 
genes that escape from XCI [69], and that LI and L2 classes 
of LINEs were enriched around genes subject to XCI in most 
translocations, the most consistently enriched short motif 
being derived from recently active LI elements. An enrichment 
for the younger LI subclasses on the X chromosome in 
general - and around genes known to escape from XCI - had 
been noted earlier [70]. Using a larger catalogue of genes 
that escape from XCI [17] an enrichment for Alu and short 
ACG/CGT motifs was observed around genes that escape from 
XCI, while a number of repeat features were enriched around 
genes that were subject to XCI, with LI and L2 having the 
greatest discriminatory power to predict whether a gene 
was subject to XCI [71]. In mice, comparison of a gene that is 
X-linked in Mus spretus and autosomal in Mus musculus 
demonstrated enrichment of an AT motif on the X-linked 
version that was subject to XCI [72]. Several mouse protein 
coding genes which escape from XCI have been found to be 
located in close proximity to long non-coding RNAs, which 
escape from XCI suggesting that long non-coding RNAs 
may also play a role in determining escape from XCI [73]. 
A subset of young LI elements were shown to be expressed 
during XCI in differentiating mouse ES cells, and enriched at 
the boundaries of escape regions, where they were suggested 
to be facilitating the spread of XCI [74]. 

The assessment of the DNA sequences flanking genes 
that resist inactivation has given clues to the potential 



identity of putative waystations. In contrast, recurrent 
integration of a BAC containing a mouse gene that escaped 
from XCI provided evidence supporting intrinsic elements 
that promote expression from the otherwise silent Xi [75]. 
Comparisons of human and mouse regions escaping XCI have 
suggested a role for CTCF in boundary regions between escape 
and subject [76], although it has generally been considered 
that these boundaries are preventing spread of silencing, 
rather than spread of escape. Interestingly, a deletion of a 
boundary in the transgene that demonstrated intrinsic escape 
from XCI led to spread of escape into adjacent genes [77], 
suggesting that boundary elements may also block spread of 
euchromatin. 

While these studies have shown consistent correlation of 
elements, notably Ll/2 with subject and Alu with escape, there 
has not been clear identification of a waystation or other 
element. Overall, we need to dig deeper into the structure of 
the X chromosome and the process of XCI to understand what 
differs between the genes that are consistently subject to XCI 
and those that escape from XCI to various extents; but we will 
need to bear in mind that the genomic distributions of many 
DNA elements are likely not independent of each other. 

Tliere is a step-wise spreading of Xist 
RNA across the X 

Genes that escape from XCI have been noted to be depleted in 
Xist [37], raising the question of how Xist RNA spread occurs 
temporally and spatially. In order to address how the Xist 
RNA 'coats' the ~150Mb chromosome researchers have 
utilised biotinylated antisense probes to isolate DNA regions 
interacting with the noncoding RNA. Engreitz et al. [39] 
showed that the initial localisation of Xist was to inactive 
genes in gene-rich domains that were in close proximity to 
the Xist gene as determined by chromosome conformation 
studies [78]. The regions with the highest Xist enrichment 
also had higher H3K27me3, and as XCI proceeded the Xist 
localisation spread to active gene-rich regions. Intriguingly, 
while the initial localisation was not silencing-dependent, 
the spread to actively transcribed regions required a 
transcriptionally competent Xist RNA [39]. The elegance 
of these studies demonstrated the power of using mouse ES 
cells in which transgenes and knockouts can be created and 
followed over the initial stages of XCI. Interestingly, only four 
of the 13 consistent genes that escape from XCI from Table 1 
were found on the list of 53 genes noted to be depleted for 
Xist, suggesting that lack of Xist interaction is not the sole 
source of escape from XCI. 

With a related capture approach, Simon et al. [38] 
observed a two-step spread of Xist. Once again the first 
localisation was detectable at gene-rich islands, before 
spreading to the intervening gene-poor regions [38]. Depletion 
of Xist from the gene bodies of genes that escape from XCI 
was observed, while the earliest domains to recruit Xist 
were active gene-rich domains, and as generally seen for 
active regions, correlated with an enrichment for short 
interspersed nuclear elements (SINEs) and reduction in 
LINEs, early replication and DNase I hypersensitivity. A 
modest enrichment for chromosome conformation-defined 
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Figure 2. Features contributing to escape from XCI. 
A: Srs was the first non-PAR gene found to escape 
from XCI in humans, and maps of genes escaping 
XCI show that genes with the least divergence from 
the Y are most likely to escape from XCI. B: Genes, 
which escape from XCI differ with respect to inactive 
(yellow hexagons) and active (green stars) chromatin 
marks as well as the presence of XIST RNA (blue 
wavy line) and promoter DNAm (white lollipops = 
unmethylated, black lollipops = methylated). C: DNA 
sequences such as waystations (pink triangles), 
escape elements (orange ovals) and boundary ele- 
ments (maroon hexagons) have been hypothesised 
to account for genes that are subject to and escape 
from XCI. D: The three dimensional structure of the 
Xi appears to bring together genes that escape from 
XCI and to involve XIST (blue wavy lines) and PRC2 
(tan ovals) in the spread of XCI. E: Together all the 
above features influence whether a gene is subject 
to, or can escape from, XCI. There does not appear 
to be a definitive set of features that cause a gene 
to escape from XCI, rather, it is likely a combination 
of multiple features that determines the degree to 
which escape occurs. 
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contacts was again observed. Spreading of XCI was also 
suggested to occur through a hierarchy of PRC2 binding sites. 
Allele-specific PRC2 binding was examined during ES cell 
differentiation and ~150 initial strong sites of enrichment 
were observed [79], which were concentrated at bivalent 
ES cell domains coinciding with CpG islands. Interestingly, 
these strong PRC2 sites did not definitively correlate with 
Xist interaction [38], although the subsequent spread to ~400 
PRC2 sites did show a correlation with Xist binding. A clear 
correlation was observed across differentiation between 



PRC2 and H3K27me3 enrichment [38], with targeting of 
H3K27me3 to active regions, and subsequent loss of H3K4me3 
being more important in the maintenance of XCI in somatic 
cells [80]. While direct interaction of the Xist repeat A region 
with PRC2 has been reported [81], more recently spatial 
separation between Xist and PRC2 has been observed along 
the Xi [82], and Jarid2 has been implicated as a mediator of 
PRC2 recruitment by Xist [83]. 

Initial tethering of Xist has been shown to involve binding 
of both the RNA and the DNA by the transcription factor 
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YYl [84]. From this foothold, the Xist RNA spreads to target 
closely interacting gene-rich regions and recruits a hierarchy 
of PRC2 sites that in turn establishes H3K27me3 and silencing, 
followed by spread of Xist and H3K27me3 more fully across the 
Xi. An unsolved question is how Xist itself manages to avoid 
being silenced. The spreading of Xist leads to one of the 
earliest events in XCI, the creation of a silent nuclear 
compartment, depleted of RNA polymerase II and transcrip- 
tion. This inner compartment of the Xi was predicted to consist 
mainly of silent repeats both in mice [85] and in humans [86], 
with genes located more to the periphery; however upon 
silencing genes appeared to be drawn into this condensed 
Xist-dense core. Recently, repeat-rich stable nuclear RNAs 
have been found to be associated with the scaffold of 
euchromatic chromosomes [87]. Such RNAs are excluded from 
the Xi; however, they are reminiscent of the XACT RNA 
association with the Xa early in development [10], potentially 
demonstrating an interplay between non-coding RNAs, 
chromatin remodelling and the three-dimensional topology 
of the chromosome. 



Conformation of the Xi differs from 
the Xa 

Chromosomes have long been seen to have different locations 
in the nucleus, and the Xi is a striking example, being 
preferentially found at the nuclear or nucleolar periphery [88, 
89]. Targeting of the Xi, or even Xi'c-containing transgenes 
to the SNF-rich perinucleolar region was critical for the 
maintenance of silencing [88]. Recent advances in molecular 
approaches to dissect the subchromosomal three-dimensional 
structure of DNA within the nucleus by examining the 
intranuclear conformation of chromosomes involve capturing 
contacts between distant regions of the genome with a cross- 
linking technique. Interrogation of these contacts has been 
accomplished through a variety of chromosome conformation 
approaches (reviewed in [90, 91]) revealing a separation into 
topological domains of different sizes. 

Analysis of chromosome conformation around the Xic in 
undifferentiated and differentiated mouse ES cells revealed 
topologically associating domains (TADs), which aligned with 
H3K27me3 and H3K9me2 [92]. Boundaries between alternate 
topological domains were enriched for CTCF, housekeeping 
genes, tRNAs and SINEs [78]. Within the Xic region, the 
boundaries were often observed to contain CTCF, however 
CTCF sites are also seen within TADs, implicating additional 
features for functionality of boundaries, the deletion of which 
resulted in disruption of a TAD [92]. In differentiated cells, 
global organisation into TADs continued on the Xa, but 
the Xi lost most long-range contacts. This more random 
organisation of the Xi was consistent with the limited 
interactions observed with an allele-specific chromosome 
conformation approach [93] ; however the genes that escaped 
from XCI also participated in long range contacts with each 
other. Such contacts suggested that 10 more genes might 
escape from XCI; however only two of these were found in 
the additional studies reviewed in Table 1 [22]. While such 
extensive studies have not been performed in humans, 
analysis of the spread of XCI into autosomal material showed 



that genes that were subject to (or escaped from) XCI clustered 
within TADs, and genes subject to XCI were more likely to 
be found in regions that have PRC2 and H3K27me3 marks 
normally on non-rearranged chromosomes [68]. 

Since Lyon first suggested that genes with Y homologues 
would escape from XCI in 1962, more complete maps of the 
genes that escape from XCI have been generated (Fig. 2A), and 
many of the players in the process of XCI and marks that are 
assembled onto silent genes have been identified (Fig. 2B). 
Models have theorised that there would be waystations, 
boundary and escape elements involved in XCI (Fig. 2C; 
reviewed in [94]), and to date, multiple elements have been 
correlated with either genes that escape from or are subject to 
XCI. Elucidation of the three-dimensional structure of the 
Xi has yielded new insights into the spread of XCI and the 
interactions between genes that escape from XCI (Fig. 2D). 
Despite the continuing progress in generating a comprehen- 
sive model of XCI, the lack of a single feature whose presence 
or absence is necessary for a gene to escape from XCI suggests 
that the variability that we observe is a reflection of multiple 
contributions to each gene or domain's activity (see Fig. 2E). 
Given the complexity of the silencing process, and the 
reported independence of many of the features from each 
other (e.g. [82, 95]), or from silencing [41], it would not be 
surprising that a gene's expression from the Xi is impacted by 
more than one of the features we have discussed in this 
review. 



Conclusions and prospects 

Improvements to the catalogues of genes that escape from XCI 
are still possible as genome-wide methodologies increase the 
depth of coverage. With these improved catalogues of XCI 
status, we may be able to refine the correlations with specific 
features. Overall, an approach to test specific regions will 
substantially further our understanding of the process of 
escape from XCI. A promising approach in recent studies has 
been the integration of over 100 kb sized BAC constructs to 
delineate the smallest regions necessary for silencing or 
escape. These have involved both multiple integrations of one 
construct into different locations [75] or integration of multiple 
constructs into one site on the X chromosome [13]. Similar to 
the spread onto autosomes, these studies are not as biased by 
the evolutionary history of the X chromosome; however they 
have the advantage of still studying the spread of silencing on 
the Xi. Mechanistic studies in humans have been limited 
because of the lack of a developmental model; however, a 
recent report of a human XIST transgene in an induced 
pluripotent stem cell may provide a means around that 
challenge [35]. New methodologies may therefore tip the 
balance towards elucidation of the complementary roles that 
DNA sequences, chromatin modifications and chromosomal 
domains play in the variable expression from the Xi in both 
humans and mice. 
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